NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning ecosystem-scale dynamics from microbiome data with MDSINE2

https://doi.org/10.1038/s41564-025-02112-6

Gibson, Travis_E; Kim, Younhun; Acharya, Sawal; Kaplan, David_E; DiBenedetto, Nicholas; Lavin, Richard; Berger, Bonnie; Allegretti, Jessica_R; Bry, Lynn; Gerber, Georg_K (September 2025, Nature Microbiology)

Abstract Although dynamical systems models are a powerful tool for analysing microbial ecosystems, challenges in learning these models from complex microbiome datasets and interpreting their outputs limit use. We introduce the Microbial Dynamical Systems Inference Engine 2 (MDSINE2), a Bayesian method that learns compact and interpretable ecosystems-scale dynamical systems models from microbiome timeseries data. Microbial dynamics are modelled as stochastic processes driven by interaction modules, or groups of microbes with similar interaction structure and responses to perturbations, and additionally, noise characteristics of data are modelled. Our open-source software package provides multiple tools for interpreting learned models, including phylogeny/taxonomy of modules, and stability, interaction topology and keystoneness. To benchmark MDSINE2, we generated microbiome timeseries data from two murine cohorts that received faecal transplants from human donors and were then subjected to dietary and antibiotic perturbations. MDSINE2 outperforms state-of-the-art methods and identifies interaction modules that provide insights into ecosystems-scale interactions in the gut microbiome.
more » « less
virDTL: Viral Recombination Analysis Through Phylogenetic Reconciliation and Its Application to Sarbecoviruses and SARS-CoV-2

https://doi.org/10.1089/cmb.2021.0507

Zaman, Sumaira; Sledzieski, Samuel; Berger, Bonnie; Wu, Yi-chieh; Bansal, Mukul S. (January 2023, Journal of Computational Biology)

Full Text Available
Conformational landscape of the yeast SAGA complex as revealed by cryo-EM

https://doi.org/10.1038/s41598-022-16391-0

Vasyliuk, Diana; Felt, Joeseph; Zhong, Ellen D.; Berger, Bonnie; Davis, Joseph H.; Yip, Calvin K. (December 2022, Scientific Reports)

Abstract Spt-Ada-Gcn5-Acetyltransferase (SAGA) is a conserved multi-subunit complex that activates RNA polymerase II-mediated transcription by acetylating and deubiquitinating nucleosomal histones and by recruiting TATA box binding protein (TBP) to DNA. The prototypical yeast Saccharomyces cerevisiae SAGA contains 19 subunits that are organized into Tra1, core, histone acetyltransferase, and deubiquitination modules. Recent cryo-electron microscopy studies have generated high-resolution structural information on the Tra1 and core modules of yeast SAGA. However, the two catalytical modules were poorly resolved due to conformational flexibility of the full assembly. Furthermore, the high sample requirement created a formidable barrier to further structural investigations of SAGA. Here, we report a workflow for isolating/stabilizing yeast SAGA and preparing cryo-EM specimens at low protein concentration using a graphene oxide support layer. With this procedure, we were able to determine a cryo-EM reconstruction of yeast SAGA at 3.1 Å resolution and examine its conformational landscape with the neural network-based algorithm cryoDRGN. Our analysis revealed that SAGA adopts a range of conformations with its HAT module and central core in different orientations relative to Tra1.
more » « less
Full Text Available
Uncovering structural ensembles from single-particle cryo-EM data using cryoDRGN

https://doi.org/10.1038/s41596-022-00763-x

Kinman, Laurel F.; Powell, Barrett M.; Zhong, Ellen D.; Berger, Bonnie; Davis, Joseph H. (November 2022, Nature Protocols)

CryoDRGN is a machine learning system for heterogenous cryo-EM reconstruction of proteins and protein complexes from single particle cryo-EM data. Central to this approach is a deep generative model for heterogeneous cryo-EM density maps, which we empirically find effectively models both discrete and continuous forms of structural variability. Once trained, cryoDRGN is capable of generating an arbitrary number of 3D density maps, and thus interpreting the resulting ensemble is a challenge. Here, we showcase interactive and automated processing approaches for analyzing cryoDRGN results. Specifically, we detail a step-by-step protocol for analysis of the assembling 50S ribosome dataset (Davis et al., EMPIAR-10076), including preparation of inputs, network training, and visualization of the resulting ensemble of density maps. Additionally, we describe and implement methods to comprehensively analyze and interpret the distribution of volumes with the assistance of an associated atomic model. This protocol is appropriate for structural biologists familiar with processing single particle cryo-EM datasets and with moderate experience navigating Python and Jupyter notebooks. It requires 3-4 days to complete.
more » « less
Full Text Available
D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions

https://doi.org/10.1016/j.cels.2021.08.010

Sledzieski, Samuel; Singh, Rohit; Cowen, Lenore; Berger, Bonnie (October 2021, Cell Systems)

Full Text Available
Topsy-Turvy: integrating a global view into sequence-based PPI prediction

https://doi.org/10.1093/bioinformatics/btac258

Singh, Rohit; Devkota, Kapil; Sledzieski, Samuel; Berger, Bonnie; Cowen, Lenore (June 2022, Bioinformatics)

Abstract SummaryComputational methods to predict protein–protein interaction (PPI) typically segregate into sequence-based ‘bottom-up’ methods that infer properties from the characteristics of the individual protein sequences, or global ‘top-down’ methods that infer properties from the pattern of already known PPIs in the species of interest. However, a way to incorporate top-down insights into sequence-based bottom-up PPI prediction methods has been elusive. We thus introduce Topsy-Turvy, a method that newly synthesizes both views in a sequence-based, multi-scale, deep-learning model for PPI prediction. While Topsy-Turvy makes predictions using only sequence data, during the training phase it takes a transfer-learning approach by incorporating patterns from both global and molecular-level views of protein interaction. In a cross-species context, we show it achieves state-of-the-art performance, offering the ability to perform genome-scale, interpretable PPI prediction for non-model organisms with no existing experimental PPI data. In species with available experimental PPI data, we further present a Topsy-Turvy hybrid (TT-Hybrid) model which integrates Topsy-Turvy with a purely network-based model for link prediction that provides information about species-specific network rewiring. TT-Hybrid makes accurate predictions for both well- and sparsely-characterized proteins, outperforming both its constituent components as well as other state-of-the-art PPI prediction methods. Furthermore, running Topsy-Turvy and TT-Hybrid screens is feasible for whole genomes, and thus these methods scale to settings where other methods (e.g. AlphaFold-Multimer) might be infeasible. The generalizability, accuracy and genome-level scalability of Topsy-Turvy and TT-Hybrid unlocks a more comprehensive map of protein interaction and organization in both model and non-model organisms. Availability and implementationhttps://topsyturvy.csail.mit.edu. Supplementary informationSupplementary data are available at Bioinformatics online.
more » « less
Genetic Neural Networks: an artificial neural network architecture for capturing gene expression relationships

https://doi.org/10.1093/bioinformatics/bty945

Eetemadi, Ameen; Tagkopoulos, Ilias; Berger, Bonnie (November 2018, Bioinformatics)

Full Text Available
RiboProP: a probabilistic ribosome positioning algorithm for ribosome profiling

https://doi.org/10.1093/bioinformatics/bty854

Zhao, Dengke; Baez, William D; Fredrick, Kurt; Bundschuh, Ralf; Berger, Bonnie (October 2018, Bioinformatics)

Full Text Available
OLGA: fast computation of generation probabilities of B- and T-cell receptor amino acid sequences and motifs

https://doi.org/10.1093/bioinformatics/btz035

Sethna, Zachary; Elhanati, Yuval; Callan, Curtis G; Walczak, Aleksandra M; Mora, Thierry; Berger, Bonnie (January 2019, Bioinformatics)

Full Text Available
Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks

https://doi.org/10.1016/j.cels.2017.11.014

Liu, Yang; Palmedo, Perry; Ye, Qing; Berger, Bonnie; Peng, Jian (January 2018, Cell Systems)

Full Text Available

« Prev Next »

Search for: All records